feat: enhance reward functions with optimum-aware options and add hyb… by Grzmro · Pull Request #10 · helix-agh/DynamicAlgorithmSelection

Grzmro · 2026-06-04T06:59:14Z

No description provided.

…rid reward strategies

wniec · 2026-06-05T12:45:40Z


 import numpy as np

+_GAP_FLOOR = 1e-8  # BBOB precision target: gaps below this count as "solved".


Nice cap. It may interfer with AOCC computation. Double check that fitness isn't clipped there twice

wniec · 2026-06-05T12:47:20Z



-def reward_binary(new_best_y, old_best_y, initial_range, is_final=False):
+def reward_binary(new_best_y, old_best_y, initial_range, is_final=False, optimum=None):


It would be nice to have Unit tests for all those reward definitions

wniec · 2026-06-05T12:54:45Z

+
+def reward_log_scaled(new_best_y, old_best_y, initial_range, is_final=False, optimum=None):
    """Log-scaled incremental improvement (original r1)."""
-    if old_best_y == float("inf"):


logarithm of the first reward was added in order to avoid reward hacking. Generally in the case of the rewards that do not take global optimum into account, it's hard for the reward not to get hacked. I think It's also important to keep in mind, that inserting global minimum into reward is making meta-bbo task significantly easier. It would be nice to compare global-optimum-aware rewards to each other, but not necessarily to the ones that do not take GO into account.

feat: enhance reward functions with optimum-aware options and add hyb…

9fbef34

…rid reward strategies

Grzmro requested a review from wniec June 4, 2026 06:59

Improvements to the reward_hybrid_sign function

4c82268

wniec reviewed Jun 5, 2026

View reviewed changes

fix normalization and tests

6db1e02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: enhance reward functions with optimum-aware options and add hyb…#10

feat: enhance reward functions with optimum-aware options and add hyb…#10
Grzmro wants to merge 3 commits into
DAS2from
reward

Grzmro commented Jun 4, 2026

Uh oh!

wniec Jun 5, 2026

Uh oh!

wniec Jun 5, 2026

Uh oh!

wniec Jun 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants


		import numpy as np

		_GAP_FLOOR = 1e-8 # BBOB precision target: gaps below this count as "solved".



		def reward_binary(new_best_y, old_best_y, initial_range, is_final=False):
		def reward_binary(new_best_y, old_best_y, initial_range, is_final=False, optimum=None):

Conversation

Grzmro commented Jun 4, 2026

Uh oh!

wniec Jun 5, 2026

Choose a reason for hiding this comment

Uh oh!

wniec Jun 5, 2026

Choose a reason for hiding this comment

Uh oh!

wniec Jun 5, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants